首页> 外文OA文献 >On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic
【2h】

On the stability of some controlled Markov chains and its applications to stochastic approximation with Markovian dynamic

机译:关于一些受控马尔可夫链的稳定性及其应用   马尔可夫动态的随机逼近

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We develop a practical approach to establish the stability, that is, therecurrence in a given set, of a large class of controlled Markov chains. Theseprocesses arise in various areas of applied science and encompass importantnumerical methods. We show in particular how individual Lyapunov functions andassociated drift conditions for the parametrized family of Markov transitionprobabilities and the parameter update can be combined to form Lyapunovfunctions for the joint process, leading to the proof of the desired stabilityproperty. Of particular interest is the fact that the approach applies even insituations where the two components of the process present a time-scaleseparation, which is a crucial feature of practical situations. We then move onto show how such a recurrence property can be used in the context of stochasticapproximation in order to prove the convergence of the parameter sequence,including in the situation where the so-called stepsize is adaptively tuned. Wefinally show that the results apply to various algorithms of interest incomputational statistics and cognate areas.
机译:我们开发一种实用的方法来建立一类受控马氏链的稳定性,即在给定的集合中出现。这些过程出现在应用科学的各个领域,并且包含重要的数值方法。我们特别展示了如何针对参数化的马尔可夫转移概率族和参数更新组合单独的Lyapunov函数和相关的漂移条件,以形成联合过程的Lyapunov函数,从而证明了所需的稳定性。特别令人感兴趣的是,该方法甚至适用于过程的两个组成部分都存在时间刻度分离的场合,这是实际情况的关键特征。然后,我们继续展示如何在随机逼近的上下文中使用这种递归属性,以证明参数序列的收敛性,包括在所谓的逐步调整得到自适应调整的情况下。我们最终证明,该结果适用于各种感兴趣的计算统计和相关区域算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号